A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data

نویسندگان

  • Ansari-Asl , Karim Ph.D. in Biomedical Engineering, Associate Professor, Electrical Engineering Dept., Faculty of Engineering, Shahid Chamran University of Ahvaz, Ahvaz, Iran
  • Einipour, Amin Ph.D. Student in Computer Engineering, Computer Engineering Dept., Faculty of Engineering, Dezful Branch, Islamic Azad University, Dezful, Iran
  • Mosleh, Mohammad Ph.D. in Computer Engineering, Assistant Professor, Computer Engineering Dept., Faculty of Engineering, Dezful Branch, Islamic Azad University, Dezful, Iran
چکیده مقاله:

Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. However, the results of the proposed methods mainly depend on the shape of the cell populations and the dimensions of the data. Therefore, it is very important to develop a method that can identify cell populations regardless of these obstacles. Method: In the proposed method, which was a library method, at first, the number of clusters (cell populations) was estimated. Estimating the number of clusters is important because in the real world, basic information such as the number and type of cell populations is not available. Thereafter, using a graph-based Gaussian kernel, while reducing the dimensions of the problem, the cell populations were identified by means of the kmeans++ clustering. Results: The results of the implementation showed that the proposed method can achieve an acceptable improvement compared to other machine learning methods presented in this regard. For example, for the ARI criterion, values of 100, 93.47 and 84.69 were obtained for Kolod, Buettner, and Usoskin single-cell data sets, respectively. Conclusion: The proposed method can cluster and thus identify cell populations with high accuracy and quality without having any basic information about the number and type of cell populations, regardless of the high dimensions of the problem.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A sparse differential clustering algorithm for tracing cell type changes via single-cell RNA-sequencing data

Cell types in cell populations change as the condition changes: some cell types die out, new cell types may emerge and surviving cell types evolve to adapt to the new condition. Using single-cell RNA-sequencing data that measure the gene expression of cells before and after the condition change, we propose an algorithm, SparseDC, which identifies cell types, traces their changes across conditio...

متن کامل

I-13: Transcriptome Dynamics of Human and Mouse Preimplantation Embryos Revealed by Single Cell RNA-Sequencing

Background: Mammalian preimplantation development is a complex process involving dramatic changes in the transcriptional architecture. However, it is still unclear about the crucial transcriptional network and key hub genes that regulate the proceeding of preimplantation embryos. Materials and Methods: Through single-cell RNAsequencing (RNA-seq) of both human and mouse preimplantation embryos, ...

متن کامل

OEFinder: a user interface to identify and visualize ordering effects in single-cell RNA-seq data

UNLABELLED A recent article identified an artifact in multiple single-cell RNA-seq (scRNA-seq) datasets generated by the Fluidigm C1 platform. Specifically, Leng et al. showed significantly increased gene expression in cells captured from sites with small or large plate output IDs. We refer to this artifact as an ordering effect (OE). Including OE genes in downstream analyses could lead to bias...

متن کامل

Single-cell RNA-sequencing of the brain

Single-cell RNA-sequencing (scRNA-seq) is revolutionizing our understanding of the genomic, transcriptomic and epigenomic landscapes of cells within organs. The mammalian brain is composed of a complex network of millions to billions of diverse cells with either highly specialized functions or support functions. With scRNA-seq it is possible to comprehensively dissect the cellular heterogeneity...

متن کامل

Mining small RNA sequencing data: a new approach to identify small nucleolar RNAs in Arabidopsis

Small nucleolar RNAs (snoRNAs) are noncoding RNAs that direct 2'-O-methylation or pseudouridylation on ribosomal RNAs or spliceosomal small nuclear RNAs. These modifications are needed to modulate the activity of ribosomes and spliceosomes. A comprehensive repertoire of snoRNAs is needed to expand the knowledge of these modifications. The sequences corresponding to snoRNAs in 18-26-nt small RNA...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}


عنوان ژورنال

دوره 7  شماره 1

صفحات  60- 72

تاریخ انتشار 2020-06

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

کلمات کلیدی

کلمات کلیدی برای این مقاله ارائه نشده است

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023